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ABSTRACT 


The Naval Personnel Pay Predictor (enlisted Model) is 
used by the Bureau of Naval Personnel as a tool for pre- 
dicting the total annual basic pay for the enlisted force 
aS an input to the budget process. A major source of error 
in the model was found to be the prediction of the length 
of service (LOS) vector, and an attempt to improve this 
prediction was made. The extreme complexity of the model 
was found to be unnecessary, and a simple exponential 
smoothing subroutine for LOS prediction did as well or 
better than the original model. It was also found that a 
double exponential smoothing subroutine, taking into account 
the trends in the force structure, would almost uniformly 


improve the one year prediction from the model. 
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TNO CLURE Chetan PROBLEM 


The Bureau of Naval Personnel requires many mathematical 
models for accurately predicting the structure of the future 
force. These models are used as tools to aid in planning 
decisions. Of special interest is the problem of costing 
the future force as a part of the budget submission 
procedure. 

The Bureau of Naval Operations determines the personnel 
requirements for the future force and passes these to the 
Bureau of Personnel for implementation. These requirements 
are presented in the form of quarterly pay grade vectors, 
that is, the number of people required in pay grade E-l, 
E-2, E-3, ..-, E-9. Since the amount of pay received is 
dependent on the member's length of service, the problem of 
predicting the total cost of the force becomes complex. 

The specific problem considered was: given the future 
size of the force by pay grade, the past and present inven- 
tories, predict the total annual base pay of the force for 


future years. 





Pie coe APPE MODEL 


One model currently used for this purpose is the Naval 
Personnel Pay Predictor (Enlisted Model), referred to as 
NAPPE. The model makes use of a data base consisting of 
three sets of quarterly inventories (pay grade by LOS Force 
matrices) for all years since 1957. The inventories are for 
United States Navy (USN), United States Naval Reserve (USNR), 
and Total Navy (TOTALNAV), the sum of the two. 

The procedure is to, first, predict the future quarterly 
LOS vectors for the desired number of years into the future 
(up to ten). This is a vector of the total number of people 
with length of service l, 2, ..., 31 years. The methodology 
used for this prediction is discussed later in this section 
and in Appendices A and B. The LOS vector is then combined 
with the pay grade requirements vector to get the predicted 
force matrix. A discussion of this procedure is also included 
in this section. The cost of the force is then simply the 
multiplication of the straight line averaged (between suc- 
cesSive quarters) number of people in each cell of the matrix 
with the pay scale for that cell, which is an input to the 


model. 


Pe soMOOTHIING THE LOS VECTORS OF THE INVENTORIES 
The first step in the NAPPE models prediction of the LOS 


vector is accomplished by a subroutine referred to as SMOOTH 





(vefer™to Appendix Awfor the mathematical model). Throughout 
the Eiceroston of SMOOTH it should be remembered that all 
calculations applied to previous years data are made inde- 
pendently for the population (the total number of people) in 
each element of the LOS vector (hence referred to as LOS cell) 
and the transition rates from one cell to the next, computed 
for all three data bases. The transition rate is simply the 
proportion of the population in cell 1 of year j which move 
to cell itl in year j+tl. The methodology is basic single 
exponential smoothing as discussed by Brown [1] and others. 

The following procedure is done independently for each 
LOS cell. For each year of historical data, a prediction is 
made based on exponentially smoothing the data up to that 
year uSing values of the smoothing constant (hence referred 
memaSeatpna) Of -05, .10, ..., .95. For each year, the pre- 
dicted value is then compared with the actual value to deter- 
mine which value of alpha would have given the best prediction. 
This results in the selection of an alpha for each LOS cell 
for each year of data. Consult Appendix A for the exact 
procedure and forms of the resulting error that are stored 
and used by the model. "Best" predictions and resulting 
errors are made for all years of historical data, finally 
resulting in a decision for the "best" alpha for predicting 
the future. 

The output of SMOOTH consists of four sets of LOS vectors. 


For each year of historical data, there is a prediction based 





on transition rates and a prediction based on previous 
year's cell populations, one pair based on the TOTALNAV data 
and the other pair on the sum of the predictions based on 
USN and USNR data. Note that due to the difference in the 
structure of the USN and USNR, the sum of these predictions 
may be different than the prediction based on their sum, 

The final prediction is made by a subroutine referred 
to as ADJSMO (refer to Appendix B for the model). ADJSMO 
considers five "methods" for prediction. These include the 
four outputs from SMOOTH plus a weighted average of these 
predictions. This weighted average is formed by multiplying 
a weight (BWT) times the average of the two transition rate 
based predictions plus the complementary weight (1.0 - BWT) 
times the average of ae two population based predictions. 
foes calculation is made for values of BWT of .45, .50, .e.e, 
95. For each year of data a "best" method (of the five), 
in the least square error sense, and a "best" weight, if a 
weighted average method was chosen, is selected for predicting 
that year. The absolute sum of the errors of the "best" 
prediction is also calculated for use in adjusting the final 
prediction. This adjustment is necessary because no transition 
rate if available to predict LOS cell l. 

Having selected a "best" method and a "best" weighting 
factor based on the last year of historical data, the model 
pecolctcutmae Llrste Luture year values for LOS cells 2-31. 


At this point the model calculates the average (over all 
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years of data) proportion of the total population that was 
mmuos cell ll Ihis proportion is then applied to the total 
force required for the quarter under consideration and com- 
pared with the number which would be in cell 1 given the 
predicted values for cells 2-31 and the required total. 

Half of the difference in these two values is then allocated 
among cells 2-31 according to the total absolute error dis- 
cussed above. The prediction for cell 1 is then the differ- 
ence between the required total size of the force and the 


predictions made for cells 2-31. 


B. GENERATING THE PAY GRADE BY LOS MATRIX 

The pay grade by LOS matrix is calculated using a method 
for renormalizing contingency tables, as described by Mosteller 
[Ref. 3]. This method is an iterative procedure which takes 
the desired marginal totals of a matrix and a given, or base, 
matrix of the desired form and constructs a matrix as similar 
as possible to the base matrix, having the marginal totals 
that were desired. 

Since this method was used throughout the research, a 


brief discussion of the procedure follows: 


Let A. . be the elements of the base matrix, i = 1,2,...,9 
ae , = 1,2 31 
| é yes oF 
Let Ba be the desired row totals 
Let Cc. be the desired column totals 
9 
(GRR = DA. .. Boreal lt. 
J al 1,) 


Toe 





pea = Be for all j 


) 
= Dien. .« Ss ae a 
1,J eet; 3) 
Soil 
Cir es A! ., for all il 
a jay iv) 
De = Cyc! fom all i 
56 Bi Ze 
weer — De AL for all i,j 
35 J) + 1,J 
he oS A 
1, J) disp) 


Return to step (1). 


The procedure is continued until the row and column totals 
converge to the desired totals. 

In the NAPPE model, the marginals are the given pay 
grade vector and the predicted LOS vector. The base matrix 
is calculated as the simple average of the last twelve 


quarterly historic inventories. 


de 








Pee week MEN oewitn AND CHANGES TO THE NAPPE MODEL 


Since the object of the research was to improve the 
predictive accuracy of the NAPPE model, the sources of error 
had to be determined. It appeared that there were two inde- 
pendent sources of error, predicting the LOS vector and the 
instability of the Mosteller procedure for completing the 
matrix. Both of these possible problem areas were studied. 
In this section is a discussion of the first area and changes 
which were made to the model to improve its predictive quality. 
This is followed by a discussion of a study to discover the 


factors which influence the Mosteller procedure. 


A. MAJOR SOURCE OF ERROR 

The removal of either of these above mentioned sources 
of error should improve the predictions of the model. An ad 
hoc test of this hypothesis was accomplished by using the 
Mosteller subroutine (called PNGPNG) with the correct LOS 
vector and comparing the results of the model with known 
values, for years with historical data available. 

The NAPPE model has a validation feature which facilitates 
this and other kinds of comparisons. AS an input to the 
model, the last date of historical data to be used is given. 
The model then only looks at data up to that date and predicts 
as if that were today's date. Also included in the NAPPE 


package (which consists of several minor models besides 
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NAPPE itself) is a model called NAPVAL. This model compares 
the NAPPE output with the actual inventories. These compari- 
sons are.discussed throughout this paper. Specifically, any 
number called "actual" will mean an output from NAPVAL. Also, 
throughout the paper, the measure of effectiveness for com- 
parison will be the total annual cost of the force, which 
is an output of both models. 

In order to accomplish the above mentioned objective, 
the SMOOTH and ADJSMC subroutines were removed from the model. 
In their place, the actual LOS vector was read from the inven- 
tories and the following table is the result of comparing 
the prediction based on this procedure and the prediction 
Of NAPPE. The elements of the table are the actual cost 
(NAPVAL), the NAPPE prediction (with the model untouched), 
and the prediction using only the Mosteller procedure (NAPPE 


with SMOOTH and ADJSMO removed), labeled "USing Actual LOS". 


TABLE I 


COMPARISON OF NAPPE WITH PURE MOSTELLER 
(costs are in millions of dollars) 


Actual NAPPE 3 Using % 
Year Cost Prediction dy ge a(@)ng Actual Los Bieta ta 
1968 bg ee LSS, 13.63 ieee .014 
1369 ZO 09 De (Qe < oun 270820 nOGH 
1970 PE PEND 22 OG Zo 2) DASE .O8l 
or 1 2,264 2,258 ,2o0 DE IANE OZ 
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These results indicated that a major source of error in the 
model, and hence a potential for improvement, results from 
the prediction of the LOS vector, as was expected. If one 
looks at any feature of the enlisted force (Such as size or 
distribution), he finds that it is not stationary in time, 
even considering statistical fluctuations. There are obvious 
trends. During war years, the force becomes larger and, on 
the average, younger, while during peace time, the force 
becomes smaller and older. Since single exponential smoothing 
does not allow for trends, it could not be expected to handle 
the problem being considered. 

However, before attacking this problem, there were other 
questions to be answered. After documenting the model, two 
other questions came to mind. Is the pure complexity of the 
model worth the computer reguirements? (The following section 
indicates not.) Is the use of the entire data base justi- 
fied? Intuitively, the answer to the second question was no. 
The size and structure of the force in the late 1950's is 
not indicative of the force in 1975. There are continual 
policy changes which affect enlistment, promotion, and 


mBeLentrTon. 


B. SIMPLE EXPONENTIAL SMOOTHING 
In order to answer these questions, the first change in 
the model was made. The SMOOTH and ADJSMO subroutines were 


removed and a subroutine, SMOTHY, replaced them. This 
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subroutine used simple exponential smoothing of the transition 
rates and only the TOTALNAV data base. 

imeethneroMOTHY Subroutine 

This subroutine results in an extensive simplifica- 

tion of the NAPPE model as it uses only four years of 
historical data and a single alpha value of 0.4. This alpha 
value may seem very large, but it was desired to make the 
prediction extremely dependent on the most recent data, which 
is the most significant. The actual subroutine is included 
at the end of the paper but the simple mathematical model 
follows: 


be the number of people in LOS cell k 


met 7. 
alg e ® ® 
in quarter j of year 1 


j,k 
For the four years of historical data calculate the 


loss rate for each quarter and each LOS independently 


pa . i ik 7 fi41,5,k41 


The following series of data were smoothed for prediction 


eee 7 TR: T Jk TR. 


UR meet), Ayal ae , Vk 

For each LOS calculate the annual loss rate for prediction 

(P. i? using single exponential smoothing. The procedure 
if 

is to iterate through four quarters of data which results 


in a single value for each year. The superscript (j), 


J = 1,2,3,4, is used to indicate the intermediate steps of 
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the procedure but need not be carried once the annual 


predicted loss rate has been calculated. 


(le) - 
P. - aTR. + (1.0 a) Ps 


’ 1,1,k pus 


Where 


Peiy ie is the final result from the previous year. 
? 


aja mn (j=) a 
ee = oe sey elk + (1.0 a) Bae Oi vale —=23, 4 
_ (4) 
Piyk ~ Pilk 


Note that only one loss rate prediction is made for each 
year. This means that seasonal variations in the loss rate 
are not taken into account. Another approach would be to 
predict a loss rate separately for each quarter, the trade- 
off being that this procedure would require more years of 
data. This raises the question of whether using older data 
which takes into account seasonal variations would result 
in a better prediction than not using this older data but 
ignoring the seasonal variation. This is an area left for 
fueehner Study . 

The prediction is now made for each quarter. 


Let T. 


44K be the prediction for year 1 (first future 
f f 


year) quarter j and LOS k 


7 





Te «i a ee eal. Oo — 9. . ) Om ale: 
aL Aig Sl 9 is tee 1, oe 


As with the NAPPE model, this gives predictions for LOS 
cells 2-31. The calculation for cell 1 was done in a manner 
Similar to the existing NAPPE model. The average proportion 
of the total force in cell 1 was calculated for the four 
years of data. For each quarter the following calculations 
were made: 

Let C1lAV be the number which would be required in 
cell 1 calculated by taking the above proportion 
of the total force requirement for the quarter 
being predicted. 
on 
Let Cl1P = Req - ¢£ fT. . total required minus 
k=2 1/3'K the sum of cells 2-31 
Half of the difference between these two values was 
then allocated among cells 2-31 on the basis of the 
number projected for that cell. 


For each cell calculate 


T 


_ {ClAV - C1P) i,j,k 
Be 4, k 2 31 
oT. 
k=2 2rIrk 
eee = 7T + ADJ 
later i,j,k 7) Aus 


18 





The value for cell lis then the difference between the 


total required and the sum of the new predictions for cells 


Pe31. 


2 


Results of SMOTHY 


The model, as described above, was then run to 


obtain one year predictions for the last ten years. The 


following table is a comparison of these results with 


outputs from NAPPE for the same time periods. 


rear 
965 
1966 
NG? 
1968 
19169 
70 
JE)gal 
oy 2 
ILS 


1974 


TABLE II 


COMPARISON OF NAPPE AND SMOTHY PREDICTIONS 
(costs are in millions of dollars) 


Actual NAPPE S SMOTHY % 
Cost Prediction Error Prediction EELOr 
562.1 1,360.0 vod lps S5teiaal (AUS) 
1,582.4 Ihpeedbas sour JL ey oh) 028 
20.7 HE Sa SS hks NF ey Re, Sy 
i OS2.6 JE seksi Sey 1h 3) Sheiens, on 
2 SOLOIST Ps 27016. 7 372 2 Oro. 7 5 EY 
DOR S ets PE OAENS) (8 262 27280 .9 sOOL 
2,264.8 Dro ooae oO 2256 <2 Soy, 
VST rae | 2,480.7 643 2,483.6 524 
2,003.0 ZO Oe: W205 2,019.3 ES 
2. 4 2,1) 3560 «138 22k . 190 
Mean 5 ey; 20 Go 
Mean 2 Seal p2o ls 
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The value given in the table as mean is the mean of the 
absolute errors and the value called mean 2 is the same 
except the outlyer (1967) is left out of the calculation. 
Leaving this value out is not unreasonable when the events 
of 1967 are taken into consideration. It was during this 
year that the structure of the force saw tremendous change 
due to the Viet Nam buildup. Any model based on past data 
cannot predict the future when major policy decisions make 
that data inappropriate. This is a point where the analyst 
using the model must use reason when looking at its output, 
a point to be discussed later. 

A close inspection of the preceeding table yields 
some surprising conclusions. Although the SMOTHY model does 
not predict uniformly better, it does significantly better 
for most years. This suggests that the complexities of the 
NAPPE model are not only unnecessary, but have a negative 


effect. 


Cc. DOUBLE EXPONENTIAL SMOOTHING 

The most important hypothesis tested was that single 
exponential smoothing is not the appropriate tool for modeling 
a time series which appears to have trends. As suggested by 
Brown [1], Goodman [2], and others, higher-order exponential 
smoothing is a valuable tool for modeling time series with 
underlying trends. Since the time series under consideration 


does not show any properties which would indicate anything 
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beyond a linear trend, only double smoothing was considered 
ii oMOTH 2 . 
1. The SMOTH2 Subroutine 
As with the SMOTHY subroutine, the SMOTH2 subroutine 
makes use of only four years of historical data and only 
the TOTALNAV inventories. It also only smoothes the loss 


rates. These loss rates (TR. 


fae are calculated exactly 


the same as in SMOTHY, and the loss rate to be used for 


prediction (D, 1s calculated as described by Brown [1]. 


rk 
The single smoothed portion (P. :) is calculated exactly as 
before. The double smoothing term is calculated by simply 
smoothing the single smoothed value. The superscript nota- 


tion is again used for the four iterations exactly as used 


to calculate P. ; 
ak 


is: (1) 7 
Pik aa ote ate (1.0 a) °5-1,k 
a (3) ia (j=) — 
Siok = OP sk + (1.0 a) ace for all j Zor 
4) 
See ak 


These two values are then combined to pick up the 


lweat trend and result in; 
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Note that again only one value of the smoothed loss 
rate is calculated for each year and the same procedural 
question was left unanswered. 

The prediction is then made for each quarter into 


the future exactly as in the SMOTHY subroutine 


ea cei, wa to Psy! 
As with the previously discussed models, this gives a 
prediction for LOS cells 2-31. The final adjustments used 
in SMOTH2 are exactly the same as used in SMOTHY. 
2. Results of SMOTH2 

As in the experiment with SMOTHY, it was obvious 
that the most recent data should be most heavily weighted. 
Therefore, an initial value of 0.4 was used for alpha. 
Since the impact of trend was the most important considera- 
tion of the research, other values of alpha were also tried. 
The following table is the result of these tests. In order 
to put the results in a form for analysis, the actual dollar 
values were not tabled but only the percentage errors. The 
elements of the table are the percentage error from the actual 
total cost of the force for NAPPE, NAPPE with the SMOTHY 
subroutine (these values are the same as Table II), and for 
NAPPE with the SMOTH2 subroutine using values of alpha of 


Ome. >, and)0.4. 


eZ. 





PAB bs 


* COMPARISON OF ABSOLUTE PERCENTAGE ERRORS 
ORS eerE pool, AND SMOTH2 


SMOTH2 

Year NAPPE SMOTHY a=.2 a=.3 a=.4 
IGS alo ~209 ~242 223 Za AS: 
1966 7 OOH: .028 OHS nO 7 .018 
SIGH] eile moo7 7096 Alleys: 1 be a8) 
1968 - Jon ae OG .098 mea 3 
LSS oy 2 a2 2 agile wus .054 
1970 AS Iz ~001 5205 .076 ~041 
Is) 7/ de 2910 2579 ~443 ~423 »473 
nS 2 .643 2524 Fey) -o27 7543 
Ue ye P20 SoG ity dl -164 ~147 
1974 dees! 5 SIS _ hay .068 noe 
Mean > StI ey, 550685 ~3l45 SPATRE EG fe NSS 
Mean 2 = AOA S25 L8 ~2274 ileal -2026 
Mean 3 AMO AL mG) SUS .0984 73 


The values of mean and mean 2 have the same definition 
as in the preceeding table. The value mean 3 was calculated 
leaving out the values for 1967, 1971, and 1972. The reason 
for making this calculation will be discussed in detail later. 

The first overview of the table would result ina 
conclusion that SMOTH2, with an alpha value of 0.3 seems to 


be a somewhat better model on the basis of the mean alone. 
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However, the mean is not the only significant feature. The 
most important observation is that SMOTH2 predicts extremely 
well for all years except 1967, 1971, and 1972. The reason 
for the poor prediction in 1967 has already been discussed. 
The reason for the poor prediction for 1971 and 1972 can 
be explained in the same manner except that the force 
structure waS moving in the opposite direction. That is, 
these were the years of major policy changes resulting from 
the end of the Viet Nam War and the shrinking of the force. 
Because of the linear trend, which is a part of 
SMOTH2, it must be expected to do poorly when the direction 
of the trend changes. This means that SMOTH2 should have 
more difficulty "turning the corner" when there are major 
policy changes. This does not mean that it is a poor model, 
but rather, some decision rule is required of the user when 


EneS OCCULS. 


D. THE MATRIX GENERATION PROCEDURE 

Although incomplete and inconclusive, a study of the 
Mosteller method, as used in this model, had some interesting 
results. It was found that, in general, the Mosteller pro- 
cedure is extremely senSitive to the base matrix. It was 
found that changing the value in one cell of the base matrix 
resulted in changing the values in virtually every cell of 
the output matrix. There was no consistency found in these 
changes. The surprising result was that the changes in the 


output matrix were sometimes greater than the initial changes 
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in the base matrix. For example, changing one cell value in 
the base matrix by less than 5% could result in changes in 
the output matrix of greater than 5% in some cells. 

This is extremely significant when considering its use 
in this model. The base matrix is calculated as a simple 
average of the last twelve quarterly inventories. This means 
that seasonal variations in the force structure are not 
taken into account and implies that a better base matrix 
may be possible. 

A hypothesis was made that a better base matrix could 
be calculated by computing some very rough transition rates 
from one cell to another and these rates applied to the one 
year previous inventory. Experiments with this hypothesis 
showed some very promising results but were inconclusive. 


Continued study in this area may be of considerable value. 
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iV. CONCLUSIONS 


The NAPPE model, in its present form of single exponen- 
tial smoothing, does not appear to be the appropriate model. 
Single exponential smoothing has as an assumption that the 
time series is baSically constant in time and the difference 
from the mean is caused by some random noise. This does 
not appear to be the case with LOS populations or with 
transition rates. 

A recommended change to the model is to remove the 
complex SMOOTH and ADJSMO subroutines and replace them with 
the SMOTH2 subroutine, uSing an alpha of 0.3. It should be 
made clear to any intended user, however, that substantial 
changes in the enlisted force management would not be 
reflected in the prediction. The modelling approach should, 
in fact, be completely revised so that changes of this magni- 
tude can be accounted for. Since pay grade totals are used 
to drive the force structure, the model is aware of impending 
changes in direction. This information is not currently 
being used in loss prediction by NAPPE. 

In addition, the base matrix used in the Mosteller pro- 
cedure could be eStimated more carefully. Based on these 
preliminary experiments, this could result in a much better 
estimate of force structure, and hence a more accurate budget 
prediction. The determination of LOS cell 1 population re- 


mains somewhat ad hoc, as does the choice of a smoothing 
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constant of 0.3. While this study has demonstrated that 
a Simpler approach to the prediction can be successfully 
taken, all other alternatives have not been thoroughly 


investigated. 
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APPENDIX A 


This appendix is a rough documentation of the SMOOTH 


Subroutine in the current NAPPE model. 


Let 
A; 1k be the actual population for year i 
Je quarter j and LOS cell k 
Pom each LOS k = 1, 2, ..., 30, calculate the transition rate 


for each year and quarter 


Pee kee itl, 5 kd 
| Ae 
t iii aiups ie 
Por each year 1 = 1, 2, ..., NYR-1l (NYR = last year of 
historical data) and each a = .05, .10, ..., .95, calculate: 
1, 
Pp a 2 eal 
Let n = 41 + 3 ieee 
a - _ ul 
P << ad IE ap (CL Ae, a) P - 


For each year then the predicted transition rate is 


i+2,0 © 4i+t1 
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The relative error in this prediction is then 
the sum of the differences between the predicted 
and actual transition rates for the year 


PISS iL ea es 


ER 
1 - eel 3 


4 
; = - 
Tear 7A! a 


Go to the INLINE subroutine to choose the best a. 


INLINE Subroutine 


For each year i= 1, 2, ..., NYR find the best qa for 


predicting the following year. 


Potweach g@ = .05, .10,7 ..., .95 calculate 
EMIN, = ER. 
1 clapee 
1 
EMIN, = 2% ER 
2 p=. 
z Z 
Bion = &) (ER...) 
5 al dlisa6 


Select the a which gives the minimum value of EMIN,, 


EMIN,, EMIN , and call them a‘, a5 a5. 
Calculate 
= 2 
SE, = . (ER) yx) 
a 


Z9 





ate 
SE, = (ER) . 

Cl f=) Within the summation 
here, each a* is the 
one which was selected 
for the given year 


Select the minimum of these three values and the 

x 
most recent a for that method is the a to be used 
to predict the following year. Call this value Oey 
For each year i=1, 2, ..., NYR and each quarter of the 


year, make the prediction based on the transition rate: 


AUR = A 


eee eer 67° = PREDS y! 


eet 


Now make a similar prediction based on population in each 
cell, 


For each year i = 2, 3, ..., NYR and each a = .05, .10, 


het = 41+) ## j= 1, 2, 3, 4 


iL ; 1 
Ss See et + (1.0 a) P n 


Bie 





oll 
aeRO © Ad+) 


The relative error in this prediction is then 


ill 
4 gh Teta Bese Aaa 
j=1 Ait1,4,kt1 


4 


ISS = 


Go to the INLINE subroutine to choose the best a. For 
each year 1 = 2, 3, ..., NYR and each quarter (the same 
value is predicted for all four quarters of a given year), 


make the prediction based on cell populations 
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APPENDIX B 


This appendix is a rough documentation of the ADJSMO 
subroutine in the current NAPPE model. 
Define the five methods or techniques used in the 
subroutine: 
me Pt). ..= T. . this is the predicted value for 
i,j 1,3, 
year 1, quarter j calculated as 
the transition rate based 


prediction from SMOOTH using 


the TOTALNAV data. 


2. P(2); ; dve this is the sum of the predictions 
v 
from SMOOTH made as above using 


USN and USNR data. 


3. eps = P this 1s the predicted value for 
year i, quarter j calculated as 
the population based prediction 
from SMOOTH using the TOTALNAV 


data. 


4. P(4). . =P. this is the sum of the predictions 
> Dara | ar 
from SMOOTH made as above using 


USN and USNR data. 
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a Pee = BWT ait) Pter = (iLsO) = jean ES) - Ee 


this is a weighted average of 
1-4 where the value of BWT is 
the value which would have 
predicted best for the previous 


year. 


Pomme ach LOS k = 2, 3, .w-«e+«, 31 


For each year 1= 1, 2, ..., NYR calculate the cumulative 
square error for each method I =1l, 2, 3, 4, 5. 
i 4 A. . - P(TI) eZ 
TEP (I), = 4 r (ed dy 
n=l j=l ee hy 


For each year calculate the cumulative Square error 


fem sall values of BWT = .45, .50, .«.«, «95 


i 4 A, .,- X, .(BWT) 2 
ET2(BWT), = EZ 2 (a 
n=l j=l liv aig 


where 


Pil) weet P (2). . 
X. .(BWT) = BWT i or ee 
1, J 2 


eee nt). 
comer On BWr) +2 +4 


ge 





Based on TEP select the best method, I, and based on 
ET2 select the best BWT which will then be used to calculate 
P(5) for the following year. 

Calculate the cumulative absolute error for the entire 
period uSing the method which was selected as best for each 


year. 


ee Gee 
TER, = 5 OE fede sd | where I, 
ee A. . - 
| iupk 


was the best 
method for that 


year. 


Make the initial prediction for the first future year 


sme Sa aE ) 


NYR+1,3 where Love 1s 


the method 


ANYR+1,3,k+1 NYR 


selected best 
on the last 


year of data. 


Let IPG, : be the total force required 
i 
Calculate the average proportion of the total 
force in cell 1 for all years of data, call it PAV 


For each quarter to be predicted, calculate 


a 





C1lAV = (PAV) (IPG) 
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Se oe a AuyR+t1,4,k 


k 
These two values are the possible predictions for 
LOS 1. Cl1AV is based on the average proportion of the force 
in cell 1, while ClP is simply the difference between the 
total required force and the predictions for cells 2-31. 


Let 


CVAY +o Clp 


CLADJ = 5 


There is a test in the model to ensure that this average 
is between the values which would have been calculated 
using the largest and smallest proportions of the total 
population in cell 1 over the entire data base. 

Take the difference between C1lADJ and Cl1P and allocate sO 
among cells 2-31 according to the total error which was 


calculated for predicting that cell using the best method. 


f (C1P - C1ADJ)TER, _jA, 5. 


: k 
See ak. SL 


= TE A. 
meee i,j,k 


Adjust cell 1 from these values 


oie. 





k 3l x 
A. . = £[PG, . - 2 A. . 
oe) pect alae) k=1 iy oey i 


The same basic procedure is used for predicting additional 


future years (up to 10) and the value of continuing the 


discussion is questionable. 
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